AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Visual-Language Interaction

# Visual-Language Interaction

Qwen2 VL 7B Visual Rft Lisa IoU Reward
Apache-2.0
Qwen2-VL-7B-Instruct is a vision-language model based on the Qwen2 architecture, supporting multimodal input of images and text, suitable for various visual-language tasks.
Image-to-Text Safetensors English
Q
Zery
726
4
Chat Vector Llava V1.5 7b Ja
A visual-language model capable of conducting dialogues in Japanese about input images, created using the Chat Vector method by combining weights from multiple models
Image-to-Text Transformers Japanese
C
toshi456
26
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase